UniTune: Text-Driven Image Editing by Fine Tuning a Diffusion Model on a Single Image

نویسندگان

چکیده

Text-driven image generation methods have shown impressive results recently, allowing casual users to generate high quality images by providing textual descriptions. However, similar capabilities for editing existing are still out of reach. usually need edit masks, struggle with edits that require significant visual changes and cannot easily keep specific details the edited portion. In this paper we make observation image-generation models can be converted image-editing simply fine-tuning them on a single image. We also show initializing stochastic sampler noised version base before sampling interpolating relevant from after further increase operation. Combining these observations, propose UniTune, novel method. UniTune gets as input an arbitrary description, carries while maintaining fidelity does not additional inputs, like masks or sketches, perform multiple same without retraining. test our method using Imagen model in range different use cases. demonstrate it is broadly applicable surprisingly wide expressive operations, including those requiring were previously impossible.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Deep Model for Super-resolution Enhancement from a Single Image

This study presents a method to reconstruct a high-resolution image using a deep convolution neural network. We propose a deep model, entitled Deep Block Super Resolution (DBSR), by fusing the output features of a deep convolutional network and a shallow convolutional network. In this way, our model benefits from high frequency and low frequency features extracted from deep and shallow networks...

متن کامل

a study on insurer solvency by panel data model: the case of iranian insurance market

the aim of this thesis is an approach for assessing insurer’s solvency for iranian insurance companies. we use of economic data with both time series and cross-sectional variation, thus by using the panel data model will survey the insurer solvency.

Develop a motivational model based on organizational image and competence

This study develops a model of employee motivation based on organizational image and competence of managers in the staff of the General Department of Sports and Youth of Isfahan Province. The statistical population of this study consisted of all employees of the General Department of Sports and Youth of Isfahan Province. The number was 327 people, a statistical sample of 149 people (men and wom...

متن کامل

Bubble formation on a single orifice in a gas solid fluidized bed using digital image analysis

Digital Image Analysis (DIA) has been employed to characterize the time evolution of a bubble injected from a single orifice into a pseudo 2-dimansional gas-solid fluidized bed. The injected bubble diameter increased with the square root of time before detachment. During bubble free flight in the bed, its diameter remains approximately constant. The center of mass of the bubble increases with t...

متن کامل

A Novel Unified Variational Image Editing Model

In this paper we propose a unified variational image editing model. It interprets image editing as a variational problem concerning the adaptive adjustments to the zeroand first-derivatives of the images which correspond to the color and gradient items. By varying the definition domain of each of the two items as well as applying diverse operators, the new model is capable of tackling a variety...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Graphics

سال: 2023

ISSN: ['0730-0301', '1557-7368']

DOI: https://doi.org/10.1145/3592451